Dataset statistics
| Number of variables | 26 |
|---|---|
| Number of observations | 87020 |
| Missing cells | 260465 |
| Missing cells (%) | 11.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 78.5 MiB |
| Average record size in memory | 946.4 B |
Variable types
| CAT | 11 |
|---|---|
| NUM | 11 |
| BOOL | 4 |
City has a high cardinality: 697 distinct values | High cardinality |
DOB has a high cardinality: 11345 distinct values | High cardinality |
Lead_Creation_Date has a high cardinality: 92 distinct values | High cardinality |
Employer_Name has a high cardinality: 43567 distinct values | High cardinality |
Salary_Account has a high cardinality: 57 distinct values | High cardinality |
EMI_Loan_Submitted is highly correlated with Loan_Amount_Submitted | High correlation |
Loan_Amount_Submitted is highly correlated with EMI_Loan_Submitted | High correlation |
City has 1003 (1.2%) missing values | Missing |
Salary_Account has 11764 (13.5%) missing values | Missing |
Loan_Amount_Submitted has 34613 (39.8%) missing values | Missing |
Loan_Tenure_Submitted has 34613 (39.8%) missing values | Missing |
Interest_Rate has 59294 (68.1%) missing values | Missing |
Processing_Fee has 59600 (68.5%) missing values | Missing |
EMI_Loan_Submitted has 59294 (68.1%) missing values | Missing |
Monthly_Income is highly skewed (γ1 = 167.5605262) | Skewed |
Existing_EMI is highly skewed (γ1 = 211.7693511) | Skewed |
ID has unique values | Unique |
Loan_Amount_Applied has 28853 (33.2%) zeros | Zeros |
Loan_Tenure_Applied has 33844 (38.9%) zeros | Zeros |
Existing_EMI has 58238 (66.9%) zeros | Zeros |
Var5 has 29087 (33.4%) zeros | Zeros |
Var4 has 2546 (2.9%) zeros | Zeros |
Reproduction
| Analysis started | 2020-09-25 13:25:50.047968 |
|---|---|
| Analysis finished | 2020-09-25 13:26:11.343439 |
| Duration | 21.3 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 87020 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 680.0 KiB |
| ID102986A10 | 1 |
|---|---|
| ID050808E30 | 1 |
| ID122562Y20 | 1 |
| ID059001H10 | 1 |
| ID067100U00 | 1 |
| Other values (87015) |
| Value | Count | Frequency (%) | |
| ID102986A10 | 1 | < 0.1% | |
| ID050808E30 | 1 | < 0.1% | |
| ID122562Y20 | 1 | < 0.1% | |
| ID059001H10 | 1 | < 0.1% | |
| ID067100U00 | 1 | < 0.1% | |
| ID056239B40 | 1 | < 0.1% | |
| ID016151F10 | 1 | < 0.1% | |
| ID096756K10 | 1 | < 0.1% | |
| ID078931V10 | 1 | < 0.1% | |
| ID086086A10 | 1 | < 0.1% | |
| ID025475V00 | 1 | < 0.1% | |
| ID104727Z20 | 1 | < 0.1% | |
| ID055224A40 | 1 | < 0.1% | |
| ID029214Q40 | 1 | < 0.1% | |
| ID117978Q30 | 1 | < 0.1% | |
| ID117695T00 | 1 | < 0.1% | |
| ID073500Y00 | 1 | < 0.1% | |
| ID039113J30 | 1 | < 0.1% | |
| ID034692I20 | 1 | < 0.1% | |
| ID051416O10 | 1 | < 0.1% | |
| ID090308K30 | 1 | < 0.1% | |
| ID019197J20 | 1 | < 0.1% | |
| ID017587L20 | 1 | < 0.1% | |
| ID080208Y30 | 1 | < 0.1% | |
| ID039895L00 | 1 | < 0.1% | |
| Other values (86995) | 86995 | > 99.9% |
Unique
| Unique | 87020 ? |
|---|---|
| Unique (%) | 100.0% |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 223276 | 23.3% | |
| I | 90405 | 9.4% | |
| D | 90380 | 9.4% | |
| 1 | 83931 | 8.8% | |
| 2 | 62612 | 6.5% | |
| 3 | 59627 | 6.2% | |
| 4 | 59586 | 6.2% | |
| 8 | 41554 | 4.3% | |
| 6 | 41547 | 4.3% | |
| 5 | 41440 | 4.3% | |
| 9 | 41372 | 4.3% | |
| 7 | 41215 | 4.3% | |
| V | 3417 | 0.4% | |
| J | 3387 | 0.4% | |
| Y | 3379 | 0.4% | |
| T | 3374 | 0.4% | |
| S | 3367 | 0.4% | |
| E | 3366 | 0.4% | |
| B | 3355 | 0.4% | |
| C | 3350 | 0.3% | |
| W | 3347 | 0.3% | |
| N | 3346 | 0.3% | |
| K | 3345 | 0.3% | |
| G | 3342 | 0.3% | |
| U | 3339 | 0.3% | |
| Other values (11) | 36561 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 696160 | 72.7% | |
| Uppercase Letter | 261060 | 27.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| I | 90405 | 34.6% | |
| D | 90380 | 34.6% | |
| V | 3417 | 1.3% | |
| J | 3387 | 1.3% | |
| Y | 3379 | 1.3% | |
| T | 3374 | 1.3% | |
| S | 3367 | 1.3% | |
| E | 3366 | 1.3% | |
| B | 3355 | 1.3% | |
| C | 3350 | 1.3% | |
| W | 3347 | 1.3% | |
| N | 3346 | 1.3% | |
| K | 3345 | 1.3% | |
| G | 3342 | 1.3% | |
| U | 3339 | 1.3% | |
| A | 3339 | 1.3% | |
| F | 3337 | 1.3% | |
| P | 3337 | 1.3% | |
| Q | 3335 | 1.3% | |
| O | 3334 | 1.3% | |
| H | 3330 | 1.3% | |
| R | 3323 | 1.3% | |
| M | 3319 | 1.3% | |
| L | 3307 | 1.3% | |
| X | 3301 | 1.3% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 223276 | 32.1% | |
| 1 | 83931 | 12.1% | |
| 2 | 62612 | 9.0% | |
| 3 | 59627 | 8.6% | |
| 4 | 59586 | 8.6% | |
| 8 | 41554 | 6.0% | |
| 6 | 41547 | 6.0% | |
| 5 | 41440 | 6.0% | |
| 9 | 41372 | 5.9% | |
| 7 | 41215 | 5.9% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 696160 | 72.7% | |
| Latin | 261060 | 27.3% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| I | 90405 | 34.6% | |
| D | 90380 | 34.6% | |
| V | 3417 | 1.3% | |
| J | 3387 | 1.3% | |
| Y | 3379 | 1.3% | |
| T | 3374 | 1.3% | |
| S | 3367 | 1.3% | |
| E | 3366 | 1.3% | |
| B | 3355 | 1.3% | |
| C | 3350 | 1.3% | |
| W | 3347 | 1.3% | |
| N | 3346 | 1.3% | |
| K | 3345 | 1.3% | |
| G | 3342 | 1.3% | |
| U | 3339 | 1.3% | |
| A | 3339 | 1.3% | |
| F | 3337 | 1.3% | |
| P | 3337 | 1.3% | |
| Q | 3335 | 1.3% | |
| O | 3334 | 1.3% | |
| H | 3330 | 1.3% | |
| R | 3323 | 1.3% | |
| M | 3319 | 1.3% | |
| L | 3307 | 1.3% | |
| X | 3301 | 1.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 223276 | 32.1% | |
| 1 | 83931 | 12.1% | |
| 2 | 62612 | 9.0% | |
| 3 | 59627 | 8.6% | |
| 4 | 59586 | 8.6% | |
| 8 | 41554 | 6.0% | |
| 6 | 41547 | 6.0% | |
| 5 | 41440 | 6.0% | |
| 9 | 41372 | 5.9% | |
| 7 | 41215 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 957220 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 223276 | 23.3% | |
| I | 90405 | 9.4% | |
| D | 90380 | 9.4% | |
| 1 | 83931 | 8.8% | |
| 2 | 62612 | 6.5% | |
| 3 | 59627 | 6.2% | |
| 4 | 59586 | 6.2% | |
| 8 | 41554 | 4.3% | |
| 6 | 41547 | 4.3% | |
| 5 | 41440 | 4.3% | |
| 9 | 41372 | 4.3% | |
| 7 | 41215 | 4.3% | |
| V | 3417 | 0.4% | |
| J | 3387 | 0.4% | |
| Y | 3379 | 0.4% | |
| T | 3374 | 0.4% | |
| S | 3367 | 0.4% | |
| E | 3366 | 0.4% | |
| B | 3355 | 0.4% | |
| C | 3350 | 0.3% | |
| W | 3347 | 0.3% | |
| N | 3346 | 0.3% | |
| K | 3345 | 0.3% | |
| G | 3342 | 0.3% | |
| U | 3339 | 0.3% | |
| Other values (11) | 36561 | 3.8% |
Gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 680.0 KiB |
| Male | |
|---|---|
| Female |
| Value | Count | Frequency (%) | |
| Male | 49848 | 57.3% | |
| Female | 37172 | 42.7% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.854332337 |
| Min length | 4 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 124192 | 29.4% | |
| a | 87020 | 20.6% | |
| l | 87020 | 20.6% | |
| M | 49848 | 11.8% | |
| F | 37172 | 8.8% | |
| m | 37172 | 8.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 335404 | 79.4% | |
| Uppercase Letter | 87020 | 20.6% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| M | 49848 | 57.3% | |
| F | 37172 | 42.7% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 124192 | 37.0% | |
| a | 87020 | 25.9% | |
| l | 87020 | 25.9% | |
| m | 37172 | 11.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 422424 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 124192 | 29.4% | |
| a | 87020 | 20.6% | |
| l | 87020 | 20.6% | |
| M | 49848 | 11.8% | |
| F | 37172 | 8.8% | |
| m | 37172 | 8.8% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 422424 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 124192 | 29.4% | |
| a | 87020 | 20.6% | |
| l | 87020 | 20.6% | |
| M | 49848 | 11.8% | |
| F | 37172 | 8.8% | |
| m | 37172 | 8.8% |
| Distinct | 697 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 1003 |
| Missing (%) | 1.2% |
| Memory size | 680.0 KiB |
| Delhi | |
|---|---|
| Bengaluru | |
| Mumbai | |
| Hyderabad | |
| Chennai | |
| Other values (692) |
| Value | Count | Frequency (%) | |
| Delhi | 12527 | 14.4% | |
| Bengaluru | 10824 | 12.4% | |
| Mumbai | 10795 | 12.4% | |
| Hyderabad | 7272 | 8.4% | |
| Chennai | 6916 | 7.9% | |
| Pune | 5207 | 6.0% | |
| Kolkata | 2888 | 3.3% | |
| Ahmedabad | 1788 | 2.1% | |
| Jaipur | 1331 | 1.5% | |
| Gurgaon | 1212 | 1.4% | |
| Coimbatore | 1147 | 1.3% | |
| Thane | 905 | 1.0% | |
| Chandigarh | 870 | 1.0% | |
| Surat | 802 | 0.9% | |
| Visakhapatnam | 764 | 0.9% | |
| Indore | 734 | 0.8% | |
| Vadodara | 624 | 0.7% | |
| Nagpur | 594 | 0.7% | |
| Lucknow | 580 | 0.7% | |
| Ghaziabad | 560 | 0.6% | |
| Bhopal | 513 | 0.6% | |
| Kochi | 492 | 0.6% | |
| Patna | 461 | 0.5% | |
| Faridabad | 447 | 0.5% | |
| Madurai | 375 | 0.4% | |
| Other values (672) | 15389 | 17.7% | |
| (Missing) | 1003 | 1.2% |
Unique
| Unique | 79 ? |
|---|---|
| Unique (%) | 0.1% |
Length
| Max length | 24 |
|---|---|
| Median length | 7 |
| Mean length | 7.098724431 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 96001 | 15.5% | |
| e | 51277 | 8.3% | |
| u | 50691 | 8.2% | |
| n | 43901 | 7.1% | |
| i | 43710 | 7.1% | |
| r | 37734 | 6.1% | |
| h | 32815 | 5.3% | |
| l | 31413 | 5.1% | |
| d | 28554 | 4.6% | |
| b | 23642 | 3.8% | |
| m | 17143 | 2.8% | |
| g | 16817 | 2.7% | |
| o | 13620 | 2.2% | |
| D | 13356 | 2.2% | |
| B | 13050 | 2.1% | |
| M | 12290 | 2.0% | |
| t | 9412 | 1.5% | |
| C | 9239 | 1.5% | |
| y | 8310 | 1.3% | |
| H | 7961 | 1.3% | |
| P | 6498 | 1.1% | |
| p | 6384 | 1.0% | |
| k | 5840 | 0.9% | |
| K | 4810 | 0.8% | |
| A | 3359 | 0.5% | |
| Other values (31) | 29904 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 527187 | 85.3% | |
| Uppercase Letter | 88314 | 14.3% | |
| Space Separator | 1818 | 0.3% | |
| Decimal Number | 382 | 0.1% | |
| Other Punctuation | 17 | < 0.1% | |
| Dash Punctuation | 11 | < 0.1% | |
| Open Punctuation | 1 | < 0.1% | |
| Close Punctuation | 1 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| D | 13356 | 15.1% | |
| B | 13050 | 14.8% | |
| M | 12290 | 13.9% | |
| C | 9239 | 10.5% | |
| H | 7961 | 9.0% | |
| P | 6498 | 7.4% | |
| K | 4810 | 5.4% | |
| A | 3359 | 3.8% | |
| G | 3034 | 3.4% | |
| N | 2413 | 2.7% | |
| J | 2143 | 2.4% | |
| V | 2120 | 2.4% | |
| S | 1954 | 2.2% | |
| T | 1740 | 2.0% | |
| R | 1108 | 1.3% | |
| L | 910 | 1.0% | |
| I | 831 | 0.9% | |
| F | 516 | 0.6% | |
| E | 348 | 0.4% | |
| U | 291 | 0.3% | |
| W | 213 | 0.2% | |
| O | 77 | 0.1% | |
| Y | 53 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 96001 | 18.2% | |
| e | 51277 | 9.7% | |
| u | 50691 | 9.6% | |
| n | 43901 | 8.3% | |
| i | 43710 | 8.3% | |
| r | 37734 | 7.2% | |
| h | 32815 | 6.2% | |
| l | 31413 | 6.0% | |
| d | 28554 | 5.4% | |
| b | 23642 | 4.5% | |
| m | 17143 | 3.3% | |
| g | 16817 | 3.2% | |
| o | 13620 | 2.6% | |
| t | 9412 | 1.8% | |
| y | 8310 | 1.6% | |
| p | 6384 | 1.2% | |
| k | 5840 | 1.1% | |
| s | 3280 | 0.6% | |
| w | 2098 | 0.4% | |
| c | 2006 | 0.4% | |
| j | 939 | 0.2% | |
| z | 823 | 0.2% | |
| v | 684 | 0.1% | |
| f | 90 | < 0.1% | |
| x | 3 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 1818 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 191 | 50.0% | |
| 4 | 191 | 50.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| & | 16 | 94.1% | |
| . | 1 | 5.9% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 11 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 1 | 100.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 1 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 615501 | 99.6% | |
| Common | 2230 | 0.4% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 96001 | 15.6% | |
| e | 51277 | 8.3% | |
| u | 50691 | 8.2% | |
| n | 43901 | 7.1% | |
| i | 43710 | 7.1% | |
| r | 37734 | 6.1% | |
| h | 32815 | 5.3% | |
| l | 31413 | 5.1% | |
| d | 28554 | 4.6% | |
| b | 23642 | 3.8% | |
| m | 17143 | 2.8% | |
| g | 16817 | 2.7% | |
| o | 13620 | 2.2% | |
| D | 13356 | 2.2% | |
| B | 13050 | 2.1% | |
| M | 12290 | 2.0% | |
| t | 9412 | 1.5% | |
| C | 9239 | 1.5% | |
| y | 8310 | 1.4% | |
| H | 7961 | 1.3% | |
| P | 6498 | 1.1% | |
| p | 6384 | 1.0% | |
| k | 5840 | 0.9% | |
| K | 4810 | 0.8% | |
| A | 3359 | 0.5% | |
| Other values (23) | 27674 | 4.5% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1818 | 81.5% | ||
| 2 | 191 | 8.6% | |
| 4 | 191 | 8.6% | |
| & | 16 | 0.7% | |
| - | 11 | 0.5% | |
| ( | 1 | < 0.1% | |
| . | 1 | < 0.1% | |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 617731 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 96001 | 15.5% | |
| e | 51277 | 8.3% | |
| u | 50691 | 8.2% | |
| n | 43901 | 7.1% | |
| i | 43710 | 7.1% | |
| r | 37734 | 6.1% | |
| h | 32815 | 5.3% | |
| l | 31413 | 5.1% | |
| d | 28554 | 4.6% | |
| b | 23642 | 3.8% | |
| m | 17143 | 2.8% | |
| g | 16817 | 2.7% | |
| o | 13620 | 2.2% | |
| D | 13356 | 2.2% | |
| B | 13050 | 2.1% | |
| M | 12290 | 2.0% | |
| t | 9412 | 1.5% | |
| C | 9239 | 1.5% | |
| y | 8310 | 1.3% | |
| H | 7961 | 1.3% | |
| P | 6498 | 1.1% | |
| p | 6384 | 1.0% | |
| k | 5840 | 0.9% | |
| K | 4810 | 0.8% | |
| A | 3359 | 0.5% | |
| Other values (31) | 29904 | 4.8% |
| Distinct | 5825 |
|---|---|
| Distinct (%) | 6.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 58849.97435 |
|---|---|
| Minimum | 0 |
| Maximum | 444554443 |
| Zeros | 314 |
| Zeros (%) | 0.4% |
| Memory size | 680.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10000 |
| Q1 | 16500 |
| median | 25000 |
| Q3 | 40000 |
| 95-th percentile | 95000 |
| Maximum | 444554443 |
| Range | 444554443 |
| Interquartile range (IQR) | 23500 |
Descriptive statistics
| Standard deviation | 2177511.361 |
|---|---|
| Coefficient of variation (CV) | 37.0010588 |
| Kurtosis | 31361.57429 |
| Mean | 58849.97435 |
| Median Absolute Deviation (MAD) | 10000 |
| Skewness | 167.5605262 |
| Sum | 5121124768 |
| Variance | 4.741555729e+12 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 25000 | 5823 | 6.7% | |
| 20000 | 4523 | 5.2% | |
| 15000 | 4246 | 4.9% | |
| 30000 | 3216 | 3.7% | |
| 50000 | 2392 | 2.7% | |
| 18000 | 2140 | 2.5% | |
| 10000 | 2136 | 2.5% | |
| 12000 | 1895 | 2.2% | |
| 40000 | 1856 | 2.1% | |
| 35000 | 1798 | 2.1% | |
| 22000 | 1608 | 1.8% | |
| 16000 | 1529 | 1.8% | |
| 17000 | 1336 | 1.5% | |
| 23000 | 1146 | 1.3% | |
| 21000 | 1113 | 1.3% | |
| 45000 | 1034 | 1.2% | |
| 14000 | 1017 | 1.2% | |
| 13000 | 975 | 1.1% | |
| 32000 | 971 | 1.1% | |
| 28000 | 969 | 1.1% | |
| 100000 | 968 | 1.1% | |
| 60000 | 965 | 1.1% | |
| 24000 | 904 | 1.0% | |
| 27000 | 874 | 1.0% | |
| 26000 | 809 | 0.9% | |
| Other values (5800) | 40777 | 46.9% |
| Value | Count | Frequency (%) | |
| 0 | 314 | 0.4% | |
| 1 | 7 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 10 | 5 | < 0.1% | |
| 11 | 1 | < 0.1% | |
| 12 | 3 | < 0.1% | |
| 13 | 3 | < 0.1% | |
| 14 | 5 | < 0.1% | |
| 15 | 4 | < 0.1% | |
| 16 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 444554443 | 1 | < 0.1% | |
| 383838383 | 1 | < 0.1% | |
| 120100132 | 1 | < 0.1% | |
| 100000000 | 4 | < 0.1% | |
| 54954545 | 1 | < 0.1% | |
| 50000000 | 1 | < 0.1% | |
| 40000785 | 1 | < 0.1% | |
| 34000000 | 1 | < 0.1% | |
| 26262626 | 1 | < 0.1% | |
| 20000000 | 5 | < 0.1% |
| Distinct | 11345 |
|---|---|
| Distinct (%) | 13.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 680.0 KiB |
| 11-Nov-80 | 306 |
|---|---|
| 02-Jan-70 | 226 |
| 01-Jan-70 | 148 |
| 01-Jan-90 | 131 |
| 01-Jan-80 | 111 |
| Other values (11340) |
| Value | Count | Frequency (%) | |
| 11-Nov-80 | 306 | 0.4% | |
| 02-Jan-70 | 226 | 0.3% | |
| 01-Jan-70 | 148 | 0.2% | |
| 01-Jan-90 | 131 | 0.2% | |
| 01-Jan-80 | 111 | 0.1% | |
| 01-Jan-86 | 99 | 0.1% | |
| 01-Jan-89 | 97 | 0.1% | |
| 01-Jan-85 | 95 | 0.1% | |
| 01-Jan-88 | 92 | 0.1% | |
| 01-Jun-85 | 78 | 0.1% | |
| 01-Jun-86 | 76 | 0.1% | |
| 01-Jan-91 | 75 | 0.1% | |
| 01-Jan-87 | 75 | 0.1% | |
| 11-Nov-88 | 71 | 0.1% | |
| 01-Jan-84 | 65 | 0.1% | |
| 01-Jul-86 | 63 | 0.1% | |
| 01-Jul-89 | 61 | 0.1% | |
| 01-Jun-88 | 61 | 0.1% | |
| 05-Jun-89 | 58 | 0.1% | |
| 10-Jun-86 | 57 | 0.1% | |
| 01-Jun-87 | 57 | 0.1% | |
| 01-Jun-90 | 56 | 0.1% | |
| 01-Jun-84 | 56 | 0.1% | |
| 01-Jul-87 | 55 | 0.1% | |
| 01-Jun-89 | 55 | 0.1% | |
| Other values (11320) | 84696 | 97.3% |
Unique
| Unique | 2499 ? |
|---|---|
| Unique (%) | 2.9% |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Most occurring characters
| Value | Count | Frequency (%) | |
| - | 174040 | 22.2% | |
| 8 | 69884 | 8.9% | |
| 1 | 51171 | 6.5% | |
| 0 | 50724 | 6.5% | |
| 2 | 41823 | 5.3% | |
| 9 | 33808 | 4.3% | |
| 7 | 27680 | 3.5% | |
| J | 27066 | 3.5% | |
| u | 26380 | 3.4% | |
| a | 23457 | 3.0% | |
| 5 | 19858 | 2.5% | |
| 6 | 19256 | 2.5% | |
| 3 | 18871 | 2.4% | |
| n | 17927 | 2.3% | |
| e | 17869 | 2.3% | |
| M | 15364 | 2.0% | |
| 4 | 15005 | 1.9% | |
| A | 14213 | 1.8% | |
| r | 13263 | 1.7% | |
| c | 12522 | 1.6% | |
| p | 12364 | 1.6% | |
| l | 9139 | 1.2% | |
| y | 8907 | 1.1% | |
| g | 7407 | 0.9% | |
| O | 6306 | 0.8% | |
| Other values (8) | 48876 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 348080 | 44.4% | |
| Dash Punctuation | 174040 | 22.2% | |
| Lowercase Letter | 174040 | 22.2% | |
| Uppercase Letter | 87020 | 11.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 8 | 69884 | 20.1% | |
| 1 | 51171 | 14.7% | |
| 0 | 50724 | 14.6% | |
| 2 | 41823 | 12.0% | |
| 9 | 33808 | 9.7% | |
| 7 | 27680 | 8.0% | |
| 5 | 19858 | 5.7% | |
| 6 | 19256 | 5.5% | |
| 3 | 18871 | 5.4% | |
| 4 | 15005 | 4.3% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 174040 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| J | 27066 | 31.1% | |
| M | 15364 | 17.7% | |
| A | 14213 | 16.3% | |
| O | 6306 | 7.2% | |
| D | 6216 | 7.1% | |
| N | 6202 | 7.1% | |
| F | 6095 | 7.0% | |
| S | 5558 | 6.4% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| u | 26380 | 15.2% | |
| a | 23457 | 13.5% | |
| n | 17927 | 10.3% | |
| e | 17869 | 10.3% | |
| r | 13263 | 7.6% | |
| c | 12522 | 7.2% | |
| p | 12364 | 7.1% | |
| l | 9139 | 5.3% | |
| y | 8907 | 5.1% | |
| g | 7407 | 4.3% | |
| t | 6306 | 3.6% | |
| o | 6202 | 3.6% | |
| v | 6202 | 3.6% | |
| b | 6095 | 3.5% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 522120 | 66.7% | |
| Latin | 261060 | 33.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| - | 174040 | 33.3% | |
| 8 | 69884 | 13.4% | |
| 1 | 51171 | 9.8% | |
| 0 | 50724 | 9.7% | |
| 2 | 41823 | 8.0% | |
| 9 | 33808 | 6.5% | |
| 7 | 27680 | 5.3% | |
| 5 | 19858 | 3.8% | |
| 6 | 19256 | 3.7% | |
| 3 | 18871 | 3.6% | |
| 4 | 15005 | 2.9% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| J | 27066 | 10.4% | |
| u | 26380 | 10.1% | |
| a | 23457 | 9.0% | |
| n | 17927 | 6.9% | |
| e | 17869 | 6.8% | |
| M | 15364 | 5.9% | |
| A | 14213 | 5.4% | |
| r | 13263 | 5.1% | |
| c | 12522 | 4.8% | |
| p | 12364 | 4.7% | |
| l | 9139 | 3.5% | |
| y | 8907 | 3.4% | |
| g | 7407 | 2.8% | |
| O | 6306 | 2.4% | |
| t | 6306 | 2.4% | |
| D | 6216 | 2.4% | |
| N | 6202 | 2.4% | |
| o | 6202 | 2.4% | |
| v | 6202 | 2.4% | |
| F | 6095 | 2.3% | |
| b | 6095 | 2.3% | |
| S | 5558 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 783180 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| - | 174040 | 22.2% | |
| 8 | 69884 | 8.9% | |
| 1 | 51171 | 6.5% | |
| 0 | 50724 | 6.5% | |
| 2 | 41823 | 5.3% | |
| 9 | 33808 | 4.3% | |
| 7 | 27680 | 3.5% | |
| J | 27066 | 3.5% | |
| u | 26380 | 3.4% | |
| a | 23457 | 3.0% | |
| 5 | 19858 | 2.5% | |
| 6 | 19256 | 2.5% | |
| 3 | 18871 | 2.4% | |
| n | 17927 | 2.3% | |
| e | 17869 | 2.3% | |
| M | 15364 | 2.0% | |
| 4 | 15005 | 1.9% | |
| A | 14213 | 1.8% | |
| r | 13263 | 1.7% | |
| c | 12522 | 1.6% | |
| p | 12364 | 1.6% | |
| l | 9139 | 1.2% | |
| y | 8907 | 1.1% | |
| g | 7407 | 0.9% | |
| O | 6306 | 0.8% | |
| Other values (8) | 48876 | 6.2% |
| Distinct | 92 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 680.0 KiB |
| 03-Jul-15 | 2315 |
|---|---|
| 23-Jul-15 | 1994 |
| 30-Jul-15 | 1297 |
| 27-Jul-15 | 1292 |
| 31-Jul-15 | 1268 |
| Other values (87) |
| Value | Count | Frequency (%) | |
| 03-Jul-15 | 2315 | 2.7% | |
| 23-Jul-15 | 1994 | 2.3% | |
| 30-Jul-15 | 1297 | 1.5% | |
| 27-Jul-15 | 1292 | 1.5% | |
| 31-Jul-15 | 1268 | 1.5% | |
| 29-Jul-15 | 1236 | 1.4% | |
| 20-Jul-15 | 1231 | 1.4% | |
| 21-Jul-15 | 1201 | 1.4% | |
| 22-Jun-15 | 1201 | 1.4% | |
| 15-Jul-15 | 1193 | 1.4% | |
| 28-Jul-15 | 1191 | 1.4% | |
| 26-May-15 | 1190 | 1.4% | |
| 18-Jul-15 | 1188 | 1.4% | |
| 22-Jul-15 | 1188 | 1.4% | |
| 23-Jun-15 | 1187 | 1.4% | |
| 17-Jun-15 | 1154 | 1.3% | |
| 04-Jun-15 | 1132 | 1.3% | |
| 05-May-15 | 1128 | 1.3% | |
| 06-Jul-15 | 1126 | 1.3% | |
| 04-May-15 | 1088 | 1.3% | |
| 29-Jun-15 | 1088 | 1.3% | |
| 13-May-15 | 1081 | 1.2% | |
| 18-May-15 | 1078 | 1.2% | |
| 03-Jun-15 | 1066 | 1.2% | |
| 27-May-15 | 1064 | 1.2% | |
| Other values (67) | 55843 | 64.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Most occurring characters
| Value | Count | Frequency (%) | |
| - | 174040 | 22.2% | |
| 1 | 122758 | 15.7% | |
| 5 | 95420 | 12.2% | |
| J | 60059 | 7.7% | |
| u | 60059 | 7.7% | |
| 2 | 38688 | 4.9% | |
| 0 | 33903 | 4.3% | |
| l | 32996 | 4.2% | |
| n | 27063 | 3.5% | |
| M | 26961 | 3.4% | |
| a | 26961 | 3.4% | |
| y | 26961 | 3.4% | |
| 3 | 15265 | 1.9% | |
| 8 | 8936 | 1.1% | |
| 7 | 8623 | 1.1% | |
| 6 | 8616 | 1.1% | |
| 9 | 8199 | 1.0% | |
| 4 | 7672 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 348080 | 44.4% | |
| Dash Punctuation | 174040 | 22.2% | |
| Lowercase Letter | 174040 | 22.2% | |
| Uppercase Letter | 87020 | 11.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 122758 | 35.3% | |
| 5 | 95420 | 27.4% | |
| 2 | 38688 | 11.1% | |
| 0 | 33903 | 9.7% | |
| 3 | 15265 | 4.4% | |
| 8 | 8936 | 2.6% | |
| 7 | 8623 | 2.5% | |
| 6 | 8616 | 2.5% | |
| 9 | 8199 | 2.4% | |
| 4 | 7672 | 2.2% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 174040 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| J | 60059 | 69.0% | |
| M | 26961 | 31.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| u | 60059 | 34.5% | |
| l | 32996 | 19.0% | |
| n | 27063 | 15.5% | |
| a | 26961 | 15.5% | |
| y | 26961 | 15.5% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 522120 | 66.7% | |
| Latin | 261060 | 33.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| - | 174040 | 33.3% | |
| 1 | 122758 | 23.5% | |
| 5 | 95420 | 18.3% | |
| 2 | 38688 | 7.4% | |
| 0 | 33903 | 6.5% | |
| 3 | 15265 | 2.9% | |
| 8 | 8936 | 1.7% | |
| 7 | 8623 | 1.7% | |
| 6 | 8616 | 1.7% | |
| 9 | 8199 | 1.6% | |
| 4 | 7672 | 1.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| J | 60059 | 23.0% | |
| u | 60059 | 23.0% | |
| l | 32996 | 12.6% | |
| n | 27063 | 10.4% | |
| M | 26961 | 10.3% | |
| a | 26961 | 10.3% | |
| y | 26961 | 10.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 783180 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| - | 174040 | 22.2% | |
| 1 | 122758 | 15.7% | |
| 5 | 95420 | 12.2% | |
| J | 60059 | 7.7% | |
| u | 60059 | 7.7% | |
| 2 | 38688 | 4.9% | |
| 0 | 33903 | 4.3% | |
| l | 32996 | 4.2% | |
| n | 27063 | 3.5% | |
| M | 26961 | 3.4% | |
| a | 26961 | 3.4% | |
| y | 26961 | 3.4% | |
| 3 | 15265 | 1.9% | |
| 8 | 8936 | 1.1% | |
| 7 | 8623 | 1.1% | |
| 6 | 8616 | 1.1% | |
| 9 | 8199 | 1.0% | |
| 4 | 7672 | 1.0% |
| Distinct | 277 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 71 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 230250.6999 |
|---|---|
| Minimum | 0 |
| Maximum | 10000000 |
| Zeros | 28853 |
| Zeros (%) | 33.2% |
| Memory size | 680.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 100000 |
| Q3 | 300000 |
| 95-th percentile | 1000000 |
| Maximum | 10000000 |
| Range | 10000000 |
| Interquartile range (IQR) | 300000 |
Descriptive statistics
| Standard deviation | 354206.7595 |
|---|---|
| Coefficient of variation (CV) | 1.538352585 |
| Kurtosis | 72.20964602 |
| Mean | 230250.6999 |
| Median Absolute Deviation (MAD) | 100000 |
| Skewness | 5.64187128 |
| Sum | 2.002006811e+10 |
| Variance | 1.254624285e+11 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 28853 | 33.2% | |
| 100000 | 14311 | 16.4% | |
| 200000 | 13058 | 15.0% | |
| 300000 | 9995 | 11.5% | |
| 500000 | 9762 | 11.2% | |
| 1000000 | 4195 | 4.8% | |
| 50000 | 1245 | 1.4% | |
| 400000 | 546 | 0.6% | |
| 150000 | 540 | 0.6% | |
| 600000 | 391 | 0.4% | |
| 1500000 | 374 | 0.4% | |
| 700000 | 343 | 0.4% | |
| 800000 | 227 | 0.3% | |
| 2000000 | 215 | 0.2% | |
| 60000 | 207 | 0.2% | |
| 250000 | 192 | 0.2% | |
| 30000 | 158 | 0.2% | |
| 350000 | 144 | 0.2% | |
| 2500000 | 134 | 0.2% | |
| 70000 | 130 | 0.1% | |
| 20000 | 109 | 0.1% | |
| 1200000 | 105 | 0.1% | |
| 80000 | 89 | 0.1% | |
| 75000 | 89 | 0.1% | |
| 40000 | 88 | 0.1% | |
| Other values (252) | 1449 | 1.7% |
| Value | Count | Frequency (%) | |
| 0 | 28853 | 33.2% | |
| 2 | 1 | < 0.1% | |
| 4 | 11 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 6 | 11 | < 0.1% | |
| 7 | 10 | < 0.1% | |
| 8 | 11 | < 0.1% | |
| 9 | 3 | < 0.1% | |
| 10 | 1 | < 0.1% | |
| 12 | 7 | < 0.1% |
| Value | Count | Frequency (%) | |
| 10000000 | 1 | < 0.1% | |
| 9999999 | 1 | < 0.1% | |
| 9000000 | 2 | < 0.1% | |
| 8000000 | 2 | < 0.1% | |
| 7000000 | 7 | < 0.1% | |
| 6500000 | 2 | < 0.1% | |
| 6000000 | 6 | < 0.1% | |
| 5500000 | 2 | < 0.1% | |
| 5000000 | 34 | < 0.1% | |
| 4800000 | 1 | < 0.1% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 71 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.131398866 |
|---|---|
| Minimum | 0 |
| Maximum | 10 |
| Zeros | 33844 |
| Zeros (%) | 38.9% |
| Memory size | 680.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.014193118 |
|---|---|
| Coefficient of variation (CV) | 0.9450099415 |
| Kurtosis | -1.430846994 |
| Mean | 2.131398866 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.264624048 |
| Sum | 185323 |
| Variance | 4.056973915 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 33844 | 38.9% | |
| 5 | 19083 | 21.9% | |
| 3 | 13080 | 15.0% | |
| 2 | 9463 | 10.9% | |
| 4 | 6620 | 7.6% | |
| 1 | 4812 | 5.5% | |
| 10 | 40 | < 0.1% | |
| 7 | 3 | < 0.1% | |
| 6 | 2 | < 0.1% | |
| 9 | 1 | < 0.1% | |
| 8 | 1 | < 0.1% | |
| (Missing) | 71 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 33844 | 38.9% | |
| 1 | 4812 | 5.5% | |
| 2 | 9463 | 10.9% | |
| 3 | 13080 | 15.0% | |
| 4 | 6620 | 7.6% | |
| 5 | 19083 | 21.9% | |
| 6 | 2 | < 0.1% | |
| 7 | 3 | < 0.1% | |
| 8 | 1 | < 0.1% | |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 10 | 40 | < 0.1% | |
| 9 | 1 | < 0.1% | |
| 8 | 1 | < 0.1% | |
| 7 | 3 | < 0.1% | |
| 6 | 2 | < 0.1% | |
| 5 | 19083 | 21.9% | |
| 4 | 6620 | 7.6% | |
| 3 | 13080 | 15.0% | |
| 2 | 9463 | 10.9% | |
| 1 | 4812 | 5.5% |
| Distinct | 3753 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 71 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3696.227824 |
|---|---|
| Minimum | 0 |
| Maximum | 10000000 |
| Zeros | 58238 |
| Zeros (%) | 66.9% |
| Memory size | 680.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 3500 |
| 95-th percentile | 18000 |
| Maximum | 10000000 |
| Range | 10000000 |
| Interquartile range (IQR) | 3500 |
Descriptive statistics
| Standard deviation | 39810.21192 |
|---|---|
| Coefficient of variation (CV) | 10.77049733 |
| Kurtosis | 49764.8527 |
| Mean | 3696.227824 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 211.7693511 |
| Sum | 321383313.1 |
| Variance | 1584852973 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 58238 | 66.9% | |
| 5000 | 2695 | 3.1% | |
| 10000 | 1737 | 2.0% | |
| 3000 | 1581 | 1.8% | |
| 4000 | 1226 | 1.4% | |
| 2000 | 1097 | 1.3% | |
| 6000 | 837 | 1.0% | |
| 15000 | 800 | 0.9% | |
| 8000 | 786 | 0.9% | |
| 2500 | 727 | 0.8% | |
| 7000 | 718 | 0.8% | |
| 3500 | 601 | 0.7% | |
| 20000 | 521 | 0.6% | |
| 12000 | 480 | 0.6% | |
| 9000 | 361 | 0.4% | |
| 4500 | 340 | 0.4% | |
| 1000 | 323 | 0.4% | |
| 1500 | 323 | 0.4% | |
| 25000 | 282 | 0.3% | |
| 11000 | 277 | 0.3% | |
| 7500 | 255 | 0.3% | |
| 30000 | 241 | 0.3% | |
| 14000 | 217 | 0.2% | |
| 13000 | 209 | 0.2% | |
| 5500 | 203 | 0.2% | |
| Other values (3728) | 11874 | 13.6% |
| Value | Count | Frequency (%) | |
| 0 | 58238 | 66.9% | |
| 1 | 43 | < 0.1% | |
| 1.5 | 1 | < 0.1% | |
| 2 | 13 | < 0.1% | |
| 3 | 7 | < 0.1% | |
| 3.5 | 2 | < 0.1% | |
| 4 | 5 | < 0.1% | |
| 4.5 | 1 | < 0.1% | |
| 5 | 2 | < 0.1% | |
| 6 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 10000000 | 1 | < 0.1% | |
| 5454365 | 1 | < 0.1% | |
| 626266 | 1 | < 0.1% | |
| 420000 | 2 | < 0.1% | |
| 300000 | 2 | < 0.1% | |
| 273000 | 1 | < 0.1% | |
| 250000 | 1 | < 0.1% | |
| 225000 | 1 | < 0.1% | |
| 200000 | 4 | < 0.1% | |
| 185000 | 1 | < 0.1% |
| Distinct | 43567 |
|---|---|
| Distinct (%) | 50.1% |
| Missing | 71 |
| Missing (%) | 0.1% |
| Memory size | 680.0 KiB |
| 0 | 4914 |
|---|---|
| TATA CONSULTANCY SERVICES LTD (TCS) | 550 |
| COGNIZANT TECHNOLOGY SOLUTIONS INDIA PVT LTD | 404 |
| ACCENTURE SERVICES PVT LTD | 324 |
| 301 | |
| Other values (43562) |
| Value | Count | Frequency (%) | |
| 0 | 4914 | 5.6% | |
| TATA CONSULTANCY SERVICES LTD (TCS) | 550 | 0.6% | |
| COGNIZANT TECHNOLOGY SOLUTIONS INDIA PVT LTD | 404 | 0.5% | |
| ACCENTURE SERVICES PVT LTD | 324 | 0.4% | |
| 301 | 0.3% | ||
| HCL TECHNOLOGIES LTD | 250 | 0.3% | |
| ICICI BANK LTD | 239 | 0.3% | |
| INDIAN AIR FORCE | 191 | 0.2% | |
| INFOSYS TECHNOLOGIES | 181 | 0.2% | |
| GENPACT | 179 | 0.2% | |
| IBM CORPORATION | 173 | 0.2% | |
| INDIAN ARMY | 171 | 0.2% | |
| TYPE SLOWLY FOR AUTO FILL | 162 | 0.2% | |
| WIPRO TECHNOLOGIES | 155 | 0.2% | |
| HDFC BANK LTD | 148 | 0.2% | |
| IKYA HUMAN CAPITAL SOLUTIONS LTD | 142 | 0.2% | |
| STATE GOVERNMENT | 134 | 0.2% | |
| INDIAN RAILWAY | 130 | 0.1% | |
| INDIAN NAVY | 128 | 0.1% | |
| ARMY | 126 | 0.1% | |
| WIPRO BPO | 116 | 0.1% | |
| OTHERS | 115 | 0.1% | |
| CONVERGYS INDIA SERVICES PVT LTD | 113 | 0.1% | |
| TECH MAHINDRA LTD | 113 | 0.1% | |
| SERCO BPO PVT LTD | 108 | 0.1% | |
| Other values (43542) | 77382 | 88.9% |
Unique
| Unique | 33451 ? |
|---|---|
| Unique (%) | 38.5% |
Length
| Max length | 103 |
|---|---|
| Median length | 20 |
| Mean length | 20.54652953 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 199386 | 11.2% | ||
| T | 154518 | 8.6% | |
| A | 148006 | 8.3% | |
| I | 130492 | 7.3% | |
| E | 127643 | 7.1% | |
| N | 110804 | 6.2% | |
| S | 104447 | 5.8% | |
| L | 103699 | 5.8% | |
| R | 91365 | 5.1% | |
| O | 88703 | 5.0% | |
| D | 87269 | 4.9% | |
| C | 66968 | 3.7% | |
| P | 56940 | 3.2% | |
| V | 46249 | 2.6% | |
| H | 42447 | 2.4% | |
| M | 40996 | 2.3% | |
| U | 37776 | 2.1% | |
| G | 31479 | 1.8% | |
| B | 19826 | 1.1% | |
| Y | 19584 | 1.1% | |
| F | 17854 | 1.0% | |
| K | 16933 | 0.9% | |
| W | 9185 | 0.5% | |
| J | 7426 | 0.4% | |
| . | 6504 | 0.4% | |
| Other values (88) | 21460 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 1568014 | 87.7% | |
| Space Separator | 199395 | 11.2% | |
| Other Punctuation | 8874 | 0.5% | |
| Decimal Number | 6304 | 0.4% | |
| Open Punctuation | 1613 | 0.1% | |
| Close Punctuation | 1599 | 0.1% | |
| Lowercase Letter | 1398 | 0.1% | |
| Dash Punctuation | 682 | < 0.1% | |
| Currency Symbol | 36 | < 0.1% | |
| Control | 14 | < 0.1% | |
| Modifier Symbol | 7 | < 0.1% | |
| Other Symbol | 7 | < 0.1% | |
| Connector Punctuation | 5 | < 0.1% | |
| Other Number | 5 | < 0.1% | |
| Math Symbol | 4 | < 0.1% | |
| Other Letter | 2 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| T | 154518 | 9.9% | |
| A | 148006 | 9.4% | |
| I | 130492 | 8.3% | |
| E | 127643 | 8.1% | |
| N | 110804 | 7.1% | |
| S | 104447 | 6.7% | |
| L | 103699 | 6.6% | |
| R | 91365 | 5.8% | |
| O | 88703 | 5.7% | |
| D | 87269 | 5.6% | |
| C | 66968 | 4.3% | |
| P | 56940 | 3.6% | |
| V | 46249 | 2.9% | |
| H | 42447 | 2.7% | |
| M | 40996 | 2.6% | |
| U | 37776 | 2.4% | |
| G | 31479 | 2.0% | |
| B | 19826 | 1.3% | |
| Y | 19584 | 1.2% | |
| F | 17854 | 1.1% | |
| K | 16933 | 1.1% | |
| W | 9185 | 0.6% | |
| J | 7426 | 0.5% | |
| X | 4007 | 0.3% | |
| Z | 2464 | 0.2% | |
| Other values (6) | 934 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 199386 | > 99.9% | ||
| 9 | < 0.1% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 1603 | 99.4% | |
| [ | 8 | 0.5% | |
| { | 2 | 0.1% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 1590 | 99.4% | |
| ] | 7 | 0.4% | |
| } | 2 | 0.1% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| . | 6504 | 73.3% | |
| & | 1703 | 19.2% | |
| , | 437 | 4.9% | |
| / | 190 | 2.1% | |
| @ | 17 | 0.2% | |
| ; | 13 | 0.1% | |
| : | 4 | < 0.1% | |
| ? | 2 | < 0.1% | |
| ¿ | 2 | < 0.1% | |
| " | 1 | < 0.1% | |
| # | 1 | < 0.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 5094 | 80.8% | |
| 2 | 259 | 4.1% | |
| 4 | 221 | 3.5% | |
| 3 | 194 | 3.1% | |
| 1 | 163 | 2.6% | |
| 7 | 130 | 2.1% | |
| 9 | 73 | 1.2% | |
| 5 | 64 | 1.0% | |
| 8 | 57 | 0.9% | |
| 6 | 49 | 0.8% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 682 | 100.0% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| ¸ | 4 | 57.1% | |
| ¨ | 1 | 14.3% | |
| ` | 1 | 14.3% | |
| ¯ | 1 | 14.3% |
Most frequent Control characters
| Value | Count | Frequency (%) | |
| | 4 | 28.6% | |
| | 2 | 14.3% | |
| 2 | 14.3% | ||
| | 1 | 7.1% | |
| | 1 | 7.1% | |
| | 1 | 7.1% | |
| | 1 | 7.1% | |
| | 1 | 7.1% | |
| | 1 | 7.1% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 5 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 270 | 19.3% | |
| t | 179 | 12.8% | |
| e | 169 | 12.1% | |
| a | 156 | 11.2% | |
| r | 112 | 8.0% | |
| o | 103 | 7.4% | |
| i | 58 | 4.1% | |
| l | 47 | 3.4% | |
| m | 47 | 3.4% | |
| v | 45 | 3.2% | |
| h | 44 | 3.1% | |
| d | 42 | 3.0% | |
| s | 29 | 2.1% | |
| p | 15 | 1.1% | |
| y | 14 | 1.0% | |
| u | 13 | 0.9% | |
| c | 13 | 0.9% | |
| g | 12 | 0.9% | |
| f | 8 | 0.6% | |
| b | 8 | 0.6% | |
| w | 4 | 0.3% | |
| k | 4 | 0.3% | |
| z | 2 | 0.1% | |
| x | 2 | 0.1% | |
| µ | 1 | 0.1% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| > | 3 | 75.0% | |
| ¬ | 1 | 25.0% |
Most frequent Currency Symbol characters
| Value | Count | Frequency (%) | |
| ¤ | 28 | 77.8% | |
| ¥ | 7 | 19.4% | |
| $ | 1 | 2.8% |
Most frequent Other Number characters
| Value | Count | Frequency (%) | |
| ¾ | 4 | 80.0% | |
| ¹ | 1 | 20.0% |
Most frequent Other Symbol characters
| Value | Count | Frequency (%) | |
| ° | 3 | 42.9% | |
| ® | 2 | 28.6% | |
| ¦ | 2 | 28.6% |
Most frequent Other Letter characters
| Value | Count | Frequency (%) | |
| ª | 1 | 50.0% | |
| º | 1 | 50.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1569413 | 87.8% | |
| Common | 218546 | 12.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| T | 154518 | 9.8% | |
| A | 148006 | 9.4% | |
| I | 130492 | 8.3% | |
| E | 127643 | 8.1% | |
| N | 110804 | 7.1% | |
| S | 104447 | 6.7% | |
| L | 103699 | 6.6% | |
| R | 91365 | 5.8% | |
| O | 88703 | 5.7% | |
| D | 87269 | 5.6% | |
| C | 66968 | 4.3% | |
| P | 56940 | 3.6% | |
| V | 46249 | 2.9% | |
| H | 42447 | 2.7% | |
| M | 40996 | 2.6% | |
| U | 37776 | 2.4% | |
| G | 31479 | 2.0% | |
| B | 19826 | 1.3% | |
| Y | 19584 | 1.2% | |
| F | 17854 | 1.1% | |
| K | 16933 | 1.1% | |
| W | 9185 | 0.6% | |
| J | 7426 | 0.5% | |
| X | 4007 | 0.3% | |
| Z | 2464 | 0.2% | |
| Other values (33) | 2333 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 199386 | 91.2% | ||
| . | 6504 | 3.0% | |
| 0 | 5094 | 2.3% | |
| & | 1703 | 0.8% | |
| ( | 1603 | 0.7% | |
| ) | 1590 | 0.7% | |
| - | 682 | 0.3% | |
| , | 437 | 0.2% | |
| 2 | 259 | 0.1% | |
| 4 | 221 | 0.1% | |
| 3 | 194 | 0.1% | |
| / | 190 | 0.1% | |
| 1 | 163 | 0.1% | |
| 7 | 130 | 0.1% | |
| 9 | 73 | < 0.1% | |
| 5 | 64 | < 0.1% | |
| 8 | 57 | < 0.1% | |
| 6 | 49 | < 0.1% | |
| ¤ | 28 | < 0.1% | |
| @ | 17 | < 0.1% | |
| ; | 13 | < 0.1% | |
| 9 | < 0.1% | ||
| [ | 8 | < 0.1% | |
| ] | 7 | < 0.1% | |
| ¥ | 7 | < 0.1% | |
| Other values (30) | 58 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1787833 | > 99.9% | |
| None | 126 | < 0.1% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 199386 | 11.2% | ||
| T | 154518 | 8.6% | |
| A | 148006 | 8.3% | |
| I | 130492 | 7.3% | |
| E | 127643 | 7.1% | |
| N | 110804 | 6.2% | |
| S | 104447 | 5.8% | |
| L | 103699 | 5.8% | |
| R | 91365 | 5.1% | |
| O | 88703 | 5.0% | |
| D | 87269 | 4.9% | |
| C | 66968 | 3.7% | |
| P | 56940 | 3.2% | |
| V | 46249 | 2.6% | |
| H | 42447 | 2.4% | |
| M | 40996 | 2.3% | |
| U | 37776 | 2.1% | |
| G | 31479 | 1.8% | |
| B | 19826 | 1.1% | |
| Y | 19584 | 1.1% | |
| F | 17854 | 1.0% | |
| K | 16933 | 0.9% | |
| W | 9185 | 0.5% | |
| J | 7426 | 0.4% | |
| . | 6504 | 0.4% | |
| Other values (59) | 21334 | 1.2% |
Most frequent None characters
| Value | Count | Frequency (%) | |
| À | 33 | 26.2% | |
| ¤ | 28 | 22.2% | |
| Â | 10 | 7.9% | |
| 9 | 7.1% | ||
| ¥ | 7 | 5.6% | |
| ¸ | 4 | 3.2% | |
| ¾ | 4 | 3.2% | |
| | 4 | 3.2% | |
| ° | 3 | 2.4% | |
| | 2 | 1.6% | |
| ® | 2 | 1.6% | |
| ¦ | 2 | 1.6% | |
| ¿ | 2 | 1.6% | |
| É | 1 | 0.8% | |
| ¨ | 1 | 0.8% | |
| Ê | 1 | 0.8% | |
| | 1 | 0.8% | |
| | 1 | 0.8% | |
| | 1 | 0.8% | |
| µ | 1 | 0.8% | |
| ¹ | 1 | 0.8% | |
| | 1 | 0.8% | |
| | 1 | 0.8% | |
| ¯ | 1 | 0.8% | |
| ª | 1 | 0.8% | |
| Other values (4) | 4 | 3.2% |
| Distinct | 57 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 11764 |
| Missing (%) | 13.5% |
| Memory size | 680.0 KiB |
| HDFC Bank | |
|---|---|
| ICICI Bank | |
| State Bank of India | |
| Axis Bank | |
| Citibank | |
| Other values (52) |
| Value | Count | Frequency (%) | |
| HDFC Bank | 17695 | 20.3% | |
| ICICI Bank | 13636 | 15.7% | |
| State Bank of India | 11843 | 13.6% | |
| Axis Bank | 8783 | 10.1% | |
| Citibank | 2376 | 2.7% | |
| Kotak Bank | 2067 | 2.4% | |
| IDBI Bank | 1550 | 1.8% | |
| Punjab National Bank | 1201 | 1.4% | |
| Bank of India | 1170 | 1.3% | |
| Bank of Baroda | 1126 | 1.3% | |
| Standard Chartered Bank | 995 | 1.1% | |
| Canara Bank | 990 | 1.1% | |
| Union Bank of India | 951 | 1.1% | |
| Yes Bank | 779 | 0.9% | |
| ING Vysya | 678 | 0.8% | |
| Corporation bank | 649 | 0.7% | |
| Indian Overseas Bank | 612 | 0.7% | |
| State Bank of Hyderabad | 597 | 0.7% | |
| Indian Bank | 555 | 0.6% | |
| Oriental Bank of Commerce | 524 | 0.6% | |
| IndusInd Bank | 503 | 0.6% | |
| Andhra Bank | 485 | 0.6% | |
| Central Bank of India | 445 | 0.5% | |
| Syndicate Bank | 415 | 0.5% | |
| Bank of Maharasthra | 406 | 0.5% | |
| Other values (32) | 4225 | 4.9% | |
| (Missing) | 11764 | 13.5% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 47 |
|---|---|
| Median length | 9 |
| Mean length | 11.09836819 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 142159 | 14.7% | |
| n | 126350 | 13.1% | |
| 111132 | 11.5% | ||
| k | 76996 | 8.0% | |
| B | 74436 | 7.7% | |
| I | 61650 | 6.4% | |
| C | 51645 | 5.3% | |
| t | 38441 | 4.0% | |
| i | 34916 | 3.6% | |
| o | 26974 | 2.8% | |
| d | 24339 | 2.5% | |
| e | 22670 | 2.3% | |
| D | 19594 | 2.0% | |
| H | 18628 | 1.9% | |
| f | 18260 | 1.9% | |
| F | 17963 | 1.9% | |
| S | 15621 | 1.6% | |
| s | 13457 | 1.4% | |
| r | 13289 | 1.4% | |
| A | 9616 | 1.0% | |
| x | 8783 | 0.9% | |
| b | 5359 | 0.6% | |
| y | 3757 | 0.4% | |
| l | 3354 | 0.3% | |
| h | 3161 | 0.3% | |
| Other values (23) | 23230 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 571445 | 59.2% | |
| Uppercase Letter | 282639 | 29.3% | |
| Space Separator | 111132 | 11.5% | |
| Other Punctuation | 456 | < 0.1% | |
| Dash Punctuation | 108 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| B | 74436 | 26.3% | |
| I | 61650 | 21.8% | |
| C | 51645 | 18.3% | |
| D | 19594 | 6.9% | |
| H | 18628 | 6.6% | |
| F | 17963 | 6.4% | |
| S | 15621 | 5.5% | |
| A | 9616 | 3.4% | |
| K | 2656 | 0.9% | |
| N | 1958 | 0.7% | |
| P | 1460 | 0.5% | |
| O | 1375 | 0.5% | |
| U | 1371 | 0.5% | |
| V | 1306 | 0.5% | |
| Y | 779 | 0.3% | |
| M | 732 | 0.3% | |
| G | 690 | 0.2% | |
| J | 390 | 0.1% | |
| T | 381 | 0.1% | |
| L | 300 | 0.1% | |
| R | 88 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 111132 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 142159 | 24.9% | |
| n | 126350 | 22.1% | |
| k | 76996 | 13.5% | |
| t | 38441 | 6.7% | |
| i | 34916 | 6.1% | |
| o | 26974 | 4.7% | |
| d | 24339 | 4.3% | |
| e | 22670 | 4.0% | |
| f | 18260 | 3.2% | |
| s | 13457 | 2.4% | |
| r | 13289 | 2.3% | |
| x | 8783 | 1.5% | |
| b | 5359 | 0.9% | |
| y | 3757 | 0.7% | |
| l | 3354 | 0.6% | |
| h | 3161 | 0.6% | |
| u | 2912 | 0.5% | |
| j | 1524 | 0.3% | |
| c | 1386 | 0.2% | |
| m | 1228 | 0.2% | |
| p | 1088 | 0.2% | |
| v | 839 | 0.1% | |
| w | 195 | < 0.1% | |
| g | 8 | < 0.1% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| & | 456 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 108 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 854084 | 88.4% | |
| Common | 111696 | 11.6% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 142159 | 16.6% | |
| n | 126350 | 14.8% | |
| k | 76996 | 9.0% | |
| B | 74436 | 8.7% | |
| I | 61650 | 7.2% | |
| C | 51645 | 6.0% | |
| t | 38441 | 4.5% | |
| i | 34916 | 4.1% | |
| o | 26974 | 3.2% | |
| d | 24339 | 2.8% | |
| e | 22670 | 2.7% | |
| D | 19594 | 2.3% | |
| H | 18628 | 2.2% | |
| f | 18260 | 2.1% | |
| F | 17963 | 2.1% | |
| S | 15621 | 1.8% | |
| s | 13457 | 1.6% | |
| r | 13289 | 1.6% | |
| A | 9616 | 1.1% | |
| x | 8783 | 1.0% | |
| b | 5359 | 0.6% | |
| y | 3757 | 0.4% | |
| l | 3354 | 0.4% | |
| h | 3161 | 0.4% | |
| u | 2912 | 0.3% | |
| Other values (20) | 19754 | 2.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 111132 | 99.5% | ||
| & | 456 | 0.4% | |
| - | 108 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 965780 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 142159 | 14.7% | |
| n | 126350 | 13.1% | |
| 111132 | 11.5% | ||
| k | 76996 | 8.0% | |
| B | 74436 | 7.7% | |
| I | 61650 | 6.4% | |
| C | 51645 | 5.3% | |
| t | 38441 | 4.0% | |
| i | 34916 | 3.6% | |
| o | 26974 | 2.8% | |
| d | 24339 | 2.5% | |
| e | 22670 | 2.3% | |
| D | 19594 | 2.0% | |
| H | 18628 | 1.9% | |
| f | 18260 | 1.9% | |
| F | 17963 | 1.9% | |
| S | 15621 | 1.6% | |
| s | 13457 | 1.4% | |
| r | 13289 | 1.4% | |
| A | 9616 | 1.0% | |
| x | 8783 | 0.9% | |
| b | 5359 | 0.6% | |
| y | 3757 | 0.4% | |
| l | 3354 | 0.3% | |
| h | 3161 | 0.3% | |
| Other values (23) | 23230 | 2.4% |
Mobile_Verified
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 680.0 KiB |
| Y | |
|---|---|
| N |
| Value | Count | Frequency (%) | |
| Y | 56481 | 64.9% | |
| N | 30539 | 35.1% |
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.961503103 |
|---|---|
| Minimum | 0 |
| Maximum | 18 |
| Zeros | 29087 |
| Zeros (%) | 33.4% |
| Memory size | 680.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 11 |
| 95-th percentile | 15 |
| Maximum | 18 |
| Range | 18 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 5.670384977 |
|---|---|
| Coefficient of variation (CV) | 1.142876435 |
| Kurtosis | -0.987668742 |
| Mean | 4.961503103 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.7606063211 |
| Sum | 431750 |
| Variance | 32.15326579 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 29087 | 33.4% | |
| 1 | 12236 | 14.1% | |
| 3 | 6759 | 7.8% | |
| 11 | 5204 | 6.0% | |
| 2 | 4485 | 5.2% | |
| 14 | 3662 | 4.2% | |
| 15 | 3509 | 4.0% | |
| 12 | 2989 | 3.4% | |
| 13 | 2622 | 3.0% | |
| 8 | 2515 | 2.9% | |
| 10 | 2427 | 2.8% | |
| 9 | 2281 | 2.6% | |
| 16 | 2097 | 2.4% | |
| 4 | 1815 | 2.1% | |
| 17 | 1691 | 1.9% | |
| 7 | 1489 | 1.7% | |
| 6 | 983 | 1.1% | |
| 5 | 975 | 1.1% | |
| 18 | 194 | 0.2% |
| Value | Count | Frequency (%) | |
| 0 | 29087 | 33.4% | |
| 1 | 12236 | 14.1% | |
| 2 | 4485 | 5.2% | |
| 3 | 6759 | 7.8% | |
| 4 | 1815 | 2.1% | |
| 5 | 975 | 1.1% | |
| 6 | 983 | 1.1% | |
| 7 | 1489 | 1.7% | |
| 8 | 2515 | 2.9% | |
| 9 | 2281 | 2.6% |
| Value | Count | Frequency (%) | |
| 18 | 194 | 0.2% | |
| 17 | 1691 | 1.9% | |
| 16 | 2097 | 2.4% | |
| 15 | 3509 | 4.0% | |
| 14 | 3662 | 4.2% | |
| 13 | 2622 | 3.0% | |
| 12 | 2989 | 3.4% | |
| 11 | 5204 | 6.0% | |
| 10 | 2427 | 2.8% | |
| 9 | 2281 | 2.6% |
Var1
Categorical
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 680.0 KiB |
| HBXX | |
|---|---|
| HBXC | |
| HBXB | 4479 |
| HAXA | 2909 |
| HBXA | 2123 |
| Other values (14) |
| Value | Count | Frequency (%) | |
| HBXX | 59294 | 68.1% | |
| HBXC | 9010 | 10.4% | |
| HBXB | 4479 | 5.1% | |
| HAXA | 2909 | 3.3% | |
| HBXA | 2123 | 2.4% | |
| HAXB | 2011 | 2.3% | |
| HBXD | 1964 | 2.3% | |
| HAXC | 1536 | 1.8% | |
| HBXH | 970 | 1.1% | |
| HCXF | 722 | 0.8% | |
| HAYT | 508 | 0.6% | |
| HAVC | 384 | 0.4% | |
| HAXM | 268 | 0.3% | |
| HCXD | 237 | 0.3% | |
| HCYS | 217 | 0.2% | |
| HVYS | 186 | 0.2% | |
| HAZD | 109 | 0.1% | |
| HCXG | 78 | 0.1% | |
| HAXF | 15 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Most occurring characters
| Value | Count | Frequency (%) | |
| X | 144910 | 41.6% | |
| H | 87990 | 25.3% | |
| B | 84330 | 24.2% | |
| A | 12772 | 3.7% | |
| C | 12184 | 3.5% | |
| D | 2310 | 0.7% | |
| Y | 911 | 0.3% | |
| F | 737 | 0.2% | |
| V | 570 | 0.2% | |
| T | 508 | 0.1% | |
| S | 403 | 0.1% | |
| M | 268 | 0.1% | |
| Z | 109 | < 0.1% | |
| G | 78 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 348080 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| X | 144910 | 41.6% | |
| H | 87990 | 25.3% | |
| B | 84330 | 24.2% | |
| A | 12772 | 3.7% | |
| C | 12184 | 3.5% | |
| D | 2310 | 0.7% | |
| Y | 911 | 0.3% | |
| F | 737 | 0.2% | |
| V | 570 | 0.2% | |
| T | 508 | 0.1% | |
| S | 403 | 0.1% | |
| M | 268 | 0.1% | |
| Z | 109 | < 0.1% | |
| G | 78 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 348080 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| X | 144910 | 41.6% | |
| H | 87990 | 25.3% | |
| B | 84330 | 24.2% | |
| A | 12772 | 3.7% | |
| C | 12184 | 3.5% | |
| D | 2310 | 0.7% | |
| Y | 911 | 0.3% | |
| F | 737 | 0.2% | |
| V | 570 | 0.2% | |
| T | 508 | 0.1% | |
| S | 403 | 0.1% | |
| M | 268 | 0.1% | |
| Z | 109 | < 0.1% | |
| G | 78 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 348080 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| X | 144910 | 41.6% | |
| H | 87990 | 25.3% | |
| B | 84330 | 24.2% | |
| A | 12772 | 3.7% | |
| C | 12184 | 3.5% | |
| D | 2310 | 0.7% | |
| Y | 911 | 0.3% | |
| F | 737 | 0.2% | |
| V | 570 | 0.2% | |
| T | 508 | 0.1% | |
| S | 403 | 0.1% | |
| M | 268 | 0.1% | |
| Z | 109 | < 0.1% | |
| G | 78 | < 0.1% |
| Distinct | 203 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 34613 |
| Missing (%) | 39.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 395010.5902 |
|---|---|
| Minimum | 50000 |
| Maximum | 3000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 680.0 KiB |
Quantile statistics
| Minimum | 50000 |
|---|---|
| 5-th percentile | 100000 |
| Q1 | 200000 |
| median | 300000 |
| Q3 | 500000 |
| 95-th percentile | 1000000 |
| Maximum | 3000000 |
| Range | 2950000 |
| Interquartile range (IQR) | 300000 |
Descriptive statistics
| Standard deviation | 308248.1363 |
|---|---|
| Coefficient of variation (CV) | 0.7803541067 |
| Kurtosis | 6.489087539 |
| Mean | 395010.5902 |
| Median Absolute Deviation (MAD) | 150000 |
| Skewness | 2.104983545 |
| Sum | 2.070132e+10 |
| Variance | 9.50169135e+10 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 100000 | 6884 | 7.9% | |
| 200000 | 6583 | 7.6% | |
| 300000 | 5385 | 6.2% | |
| 500000 | 4849 | 5.6% | |
| 1000000 | 1644 | 1.9% | |
| 400000 | 1229 | 1.4% | |
| 290000 | 1039 | 1.2% | |
| 350000 | 820 | 0.9% | |
| 360000 | 816 | 0.9% | |
| 420000 | 774 | 0.9% | |
| 340000 | 738 | 0.8% | |
| 150000 | 737 | 0.8% | |
| 330000 | 734 | 0.8% | |
| 450000 | 728 | 0.8% | |
| 320000 | 655 | 0.8% | |
| 1500000 | 652 | 0.7% | |
| 390000 | 601 | 0.7% | |
| 240000 | 580 | 0.7% | |
| 1200000 | 503 | 0.6% | |
| 220000 | 491 | 0.6% | |
| 190000 | 489 | 0.6% | |
| 250000 | 489 | 0.6% | |
| 370000 | 467 | 0.5% | |
| 380000 | 420 | 0.5% | |
| 600000 | 403 | 0.5% | |
| Other values (178) | 13697 | 15.7% | |
| (Missing) | 34613 | 39.8% |
| Value | Count | Frequency (%) | |
| 50000 | 352 | 0.4% | |
| 60000 | 199 | 0.2% | |
| 70000 | 229 | 0.3% | |
| 80000 | 184 | 0.2% | |
| 90000 | 165 | 0.2% | |
| 100000 | 6884 | 7.9% | |
| 110000 | 150 | 0.2% | |
| 120000 | 230 | 0.3% | |
| 130000 | 215 | 0.2% | |
| 140000 | 172 | 0.2% |
| Value | Count | Frequency (%) | |
| 3000000 | 7 | < 0.1% | |
| 2880000 | 1 | < 0.1% | |
| 2640000 | 1 | < 0.1% | |
| 2570000 | 1 | < 0.1% | |
| 2500000 | 47 | 0.1% | |
| 2480000 | 1 | < 0.1% | |
| 2470000 | 1 | < 0.1% | |
| 2460000 | 1 | < 0.1% | |
| 2410000 | 1 | < 0.1% | |
| 2400000 | 1 | < 0.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 34613 |
| Missing (%) | 39.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.891369474 |
|---|---|
| Minimum | 1 |
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 680.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.16535892 |
|---|---|
| Coefficient of variation (CV) | 0.2994726993 |
| Kurtosis | -0.2376391343 |
| Mean | 3.891369474 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.8433232334 |
| Sum | 203935 |
| Variance | 1.358061413 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 5 | 20765 | 23.9% | |
| 4 | 15135 | 17.4% | |
| 3 | 8858 | 10.2% | |
| 2 | 5332 | 6.1% | |
| 1 | 2314 | 2.7% | |
| 6 | 3 | < 0.1% | |
| (Missing) | 34613 | 39.8% |
| Value | Count | Frequency (%) | |
| 1 | 2314 | 2.7% | |
| 2 | 5332 | 6.1% | |
| 3 | 8858 | 10.2% | |
| 4 | 15135 | 17.4% | |
| 5 | 20765 | 23.9% | |
| 6 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 6 | 3 | < 0.1% | |
| 5 | 20765 | 23.9% | |
| 4 | 15135 | 17.4% | |
| 3 | 8858 | 10.2% | |
| 2 | 5332 | 6.1% | |
| 1 | 2314 | 2.7% |
| Distinct | 73 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 59294 |
| Missing (%) | 68.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.19747421 |
|---|---|
| Minimum | 11.99 |
| Maximum | 37 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 680.0 KiB |
Quantile statistics
| Minimum | 11.99 |
|---|---|
| 5-th percentile | 13.5 |
| Q1 | 15.25 |
| median | 18 |
| Q3 | 20 |
| 95-th percentile | 31.5 |
| Maximum | 37 |
| Range | 25.01 |
| Interquartile range (IQR) | 4.75 |
Descriptive statistics
| Standard deviation | 5.834213258 |
|---|---|
| Coefficient of variation (CV) | 0.3039052531 |
| Kurtosis | 1.14015201 |
| Mean | 19.19747421 |
| Median Absolute Deviation (MAD) | 2.5 |
| Skewness | 1.430301188 |
| Sum | 532269.17 |
| Variance | 34.03804434 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 20 | 4707 | 5.4% | |
| 14.85 | 2016 | 2.3% | |
| 13.99 | 1699 | 2.0% | |
| 31.5 | 1696 | 1.9% | |
| 15.25 | 1553 | 1.8% | |
| 16.75 | 1518 | 1.7% | |
| 18.25 | 1312 | 1.5% | |
| 15.5 | 1292 | 1.5% | |
| 28.5 | 950 | 1.1% | |
| 18.4 | 800 | 0.9% | |
| 13 | 660 | 0.8% | |
| 24 | 649 | 0.7% | |
| 19 | 625 | 0.7% | |
| 15.75 | 557 | 0.6% | |
| 13.5 | 521 | 0.6% | |
| 18.15 | 506 | 0.6% | |
| 35.5 | 493 | 0.6% | |
| 18 | 474 | 0.5% | |
| 17 | 416 | 0.5% | |
| 16.25 | 370 | 0.4% | |
| 17.5 | 359 | 0.4% | |
| 18.5 | 315 | 0.4% | |
| 37 | 302 | 0.3% | |
| 14.49 | 292 | 0.3% | |
| 13.49 | 275 | 0.3% | |
| Other values (48) | 3369 | 3.9% | |
| (Missing) | 59294 | 68.1% |
| Value | Count | Frequency (%) | |
| 11.99 | 90 | 0.1% | |
| 12.99 | 191 | 0.2% | |
| 13 | 660 | 0.8% | |
| 13.25 | 87 | 0.1% | |
| 13.49 | 275 | 0.3% | |
| 13.5 | 521 | 0.6% | |
| 13.75 | 255 | 0.3% | |
| 13.99 | 1699 | 2.0% | |
| 14 | 4 | < 0.1% | |
| 14.25 | 262 | 0.3% |
| Value | Count | Frequency (%) | |
| 37 | 302 | 0.3% | |
| 35.5 | 493 | 0.6% | |
| 33 | 158 | 0.2% | |
| 32.5 | 212 | 0.2% | |
| 31.5 | 1696 | 1.9% | |
| 31 | 56 | 0.1% | |
| 30.5 | 13 | < 0.1% | |
| 29.5 | 26 | < 0.1% | |
| 29 | 46 | 0.1% | |
| 28.5 | 950 | 1.1% |
| Distinct | 571 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 59600 |
| Missing (%) | 68.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5131.150839 |
|---|---|
| Minimum | 200 |
| Maximum | 50000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 680.0 KiB |
Quantile statistics
| Minimum | 200 |
|---|---|
| 5-th percentile | 1000 |
| Q1 | 2000 |
| median | 4000 |
| Q3 | 6250 |
| 95-th percentile | 14000 |
| Maximum | 50000 |
| Range | 49800 |
| Interquartile range (IQR) | 4250 |
Descriptive statistics
| Standard deviation | 4725.837644 |
|---|---|
| Coefficient of variation (CV) | 0.9210093003 |
| Kurtosis | 10.58856672 |
| Mean | 5131.150839 |
| Median Absolute Deviation (MAD) | 2000 |
| Skewness | 2.680108856 |
| Sum | 140696156 |
| Variance | 22333541.44 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 2000 | 3074 | 3.5% | |
| 1000 | 2067 | 2.4% | |
| 4000 | 2006 | 2.3% | |
| 3000 | 1286 | 1.5% | |
| 6000 | 1183 | 1.4% | |
| 10000 | 1093 | 1.3% | |
| 1500 | 641 | 0.7% | |
| 5000 | 584 | 0.7% | |
| 2500 | 552 | 0.6% | |
| 4500 | 468 | 0.5% | |
| 3800 | 319 | 0.4% | |
| 2900 | 317 | 0.4% | |
| 3600 | 296 | 0.3% | |
| 4200 | 287 | 0.3% | |
| 4400 | 282 | 0.3% | |
| 3300 | 276 | 0.3% | |
| 3200 | 267 | 0.3% | |
| 3500 | 264 | 0.3% | |
| 1600 | 256 | 0.3% | |
| 8000 | 248 | 0.3% | |
| 4800 | 241 | 0.3% | |
| 6800 | 241 | 0.3% | |
| 2400 | 231 | 0.3% | |
| 7500 | 228 | 0.3% | |
| 5800 | 223 | 0.3% | |
| Other values (546) | 10490 | 12.1% | |
| (Missing) | 59600 | 68.5% |
| Value | Count | Frequency (%) | |
| 200 | 1 | < 0.1% | |
| 250 | 19 | < 0.1% | |
| 300 | 6 | < 0.1% | |
| 325 | 1 | < 0.1% | |
| 350 | 7 | < 0.1% | |
| 375 | 2 | < 0.1% | |
| 400 | 15 | < 0.1% | |
| 450 | 3 | < 0.1% | |
| 480 | 1 | < 0.1% | |
| 500 | 196 | 0.2% |
| Value | Count | Frequency (%) | |
| 50000 | 3 | < 0.1% | |
| 45400 | 1 | < 0.1% | |
| 40000 | 15 | < 0.1% | |
| 38600 | 1 | < 0.1% | |
| 37600 | 1 | < 0.1% | |
| 37500 | 6 | < 0.1% | |
| 36000 | 24 | < 0.1% | |
| 34400 | 1 | < 0.1% | |
| 34000 | 5 | < 0.1% | |
| 33800 | 1 | < 0.1% |
| Distinct | 4530 |
|---|---|
| Distinct (%) | 16.3% |
| Missing | 59294 |
| Missing (%) | 68.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10999.52838 |
|---|---|
| Minimum | 1176.41 |
| Maximum | 144748.28 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 680.0 KiB |
Quantile statistics
| Minimum | 1176.41 |
|---|---|
| 5-th percentile | 3447.1 |
| Q1 | 6491.6 |
| median | 9392.97 |
| Q3 | 12919.04 |
| 95-th percentile | 25444.4125 |
| Maximum | 144748.28 |
| Range | 143571.87 |
| Interquartile range (IQR) | 6427.44 |
Descriptive statistics
| Standard deviation | 7512.32305 |
|---|---|
| Coefficient of variation (CV) | 0.6829677412 |
| Kurtosis | 16.8985671 |
| Mean | 10999.52838 |
| Median Absolute Deviation (MAD) | 3306.9 |
| Skewness | 2.754955411 |
| Sum | 304972923.8 |
| Variance | 56434997.6 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 3716.36 | 288 | 0.3% | |
| 7948.17 | 252 | 0.3% | |
| 5089.58 | 240 | 0.3% | |
| 5298.78 | 229 | 0.3% | |
| 8742.98 | 218 | 0.3% | |
| 7432.72 | 215 | 0.2% | |
| 10597.55 | 214 | 0.2% | |
| 7683.23 | 183 | 0.2% | |
| 2649.39 | 177 | 0.2% | |
| 8852.07 | 155 | 0.2% | |
| 11855.63 | 140 | 0.2% | |
| 11960.68 | 136 | 0.2% | |
| 4327.73 | 135 | 0.2% | |
| 12026.6 | 133 | 0.2% | |
| 11631.53 | 132 | 0.2% | |
| 11947.21 | 118 | 0.1% | |
| 13246.94 | 114 | 0.1% | |
| 7007.89 | 109 | 0.1% | |
| 6086.07 | 103 | 0.1% | |
| 9537.8 | 102 | 0.1% | |
| 9037.63 | 100 | 0.1% | |
| 5668.78 | 99 | 0.1% | |
| 7745.56 | 97 | 0.1% | |
| 10696.25 | 96 | 0.1% | |
| 11149.08 | 95 | 0.1% | |
| Other values (4505) | 23846 | 27.4% | |
| (Missing) | 59294 | 68.1% |
| Value | Count | Frequency (%) | |
| 1176.41 | 1 | < 0.1% | |
| 1185.56 | 5 | < 0.1% | |
| 1196.07 | 5 | < 0.1% | |
| 1202.66 | 4 | < 0.1% | |
| 1222.55 | 2 | < 0.1% | |
| 1235.92 | 1 | < 0.1% | |
| 1256.11 | 1 | < 0.1% | |
| 1269.67 | 1 | < 0.1% | |
| 1273.75 | 1 | < 0.1% | |
| 1317.75 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 144748.28 | 1 | < 0.1% | |
| 135564.48 | 2 | < 0.1% | |
| 97211.02 | 1 | < 0.1% | |
| 87489.92 | 1 | < 0.1% | |
| 79306.65 | 1 | < 0.1% | |
| 67291.72 | 1 | < 0.1% | |
| 67140.83 | 3 | < 0.1% | |
| 66696.84 | 4 | < 0.1% | |
| 66234.71 | 2 | < 0.1% | |
| 63917.81 | 1 | < 0.1% |
Filled_Form
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 680.0 KiB |
| N | |
|---|---|
| Y |
| Value | Count | Frequency (%) | |
| N | 67530 | 77.6% | |
| Y | 19490 | 22.4% |
Device_Type
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 680.0 KiB |
| Web-browser | |
|---|---|
| Mobile |
| Value | Count | Frequency (%) | |
| Web-browser | 64316 | 73.9% | |
| Mobile | 22704 | 26.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 9.695472305 |
| Min length | 6 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 151336 | 17.9% | |
| b | 151336 | 17.9% | |
| r | 128632 | 15.2% | |
| o | 87020 | 10.3% | |
| W | 64316 | 7.6% | |
| - | 64316 | 7.6% | |
| w | 64316 | 7.6% | |
| s | 64316 | 7.6% | |
| M | 22704 | 2.7% | |
| i | 22704 | 2.7% | |
| l | 22704 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 692364 | 82.1% | |
| Uppercase Letter | 87020 | 10.3% | |
| Dash Punctuation | 64316 | 7.6% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| W | 64316 | 73.9% | |
| M | 22704 | 26.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 151336 | 21.9% | |
| b | 151336 | 21.9% | |
| r | 128632 | 18.6% | |
| o | 87020 | 12.6% | |
| w | 64316 | 9.3% | |
| s | 64316 | 9.3% | |
| i | 22704 | 3.3% | |
| l | 22704 | 3.3% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 64316 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 779384 | 92.4% | |
| Common | 64316 | 7.6% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 151336 | 19.4% | |
| b | 151336 | 19.4% | |
| r | 128632 | 16.5% | |
| o | 87020 | 11.2% | |
| W | 64316 | 8.3% | |
| w | 64316 | 8.3% | |
| s | 64316 | 8.3% | |
| M | 22704 | 2.9% | |
| i | 22704 | 2.9% | |
| l | 22704 | 2.9% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| - | 64316 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 843700 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 151336 | 17.9% | |
| b | 151336 | 17.9% | |
| r | 128632 | 15.2% | |
| o | 87020 | 10.3% | |
| W | 64316 | 7.6% | |
| - | 64316 | 7.6% | |
| w | 64316 | 7.6% | |
| s | 64316 | 7.6% | |
| M | 22704 | 2.7% | |
| i | 22704 | 2.7% | |
| l | 22704 | 2.7% |
Var2
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 680.0 KiB |
| B | |
|---|---|
| G | |
| C | |
| E | 1315 |
| D | 634 |
| Other values (2) | 549 |
| Value | Count | Frequency (%) | |
| B | 37280 | 42.8% | |
| G | 33032 | 38.0% | |
| C | 14210 | 16.3% | |
| E | 1315 | 1.5% | |
| D | 634 | 0.7% | |
| F | 544 | 0.6% | |
| A | 5 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| B | 37280 | 42.8% | |
| G | 33032 | 38.0% | |
| C | 14210 | 16.3% | |
| E | 1315 | 1.5% | |
| D | 634 | 0.7% | |
| F | 544 | 0.6% | |
| A | 5 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 87020 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| B | 37280 | 42.8% | |
| G | 33032 | 38.0% | |
| C | 14210 | 16.3% | |
| E | 1315 | 1.5% | |
| D | 634 | 0.7% | |
| F | 544 | 0.6% | |
| A | 5 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 87020 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| B | 37280 | 42.8% | |
| G | 33032 | 38.0% | |
| C | 14210 | 16.3% | |
| E | 1315 | 1.5% | |
| D | 634 | 0.7% | |
| F | 544 | 0.6% | |
| A | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 87020 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| B | 37280 | 42.8% | |
| G | 33032 | 38.0% | |
| C | 14210 | 16.3% | |
| E | 1315 | 1.5% | |
| D | 634 | 0.7% | |
| F | 544 | 0.6% | |
| A | 5 | < 0.1% |
Source
Categorical
| Distinct | 30 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 680.0 KiB |
| S122 | |
|---|---|
| S133 | |
| S159 | |
| S143 | |
| S127 | 1931 |
| Other values (25) |
| Value | Count | Frequency (%) | |
| S122 | 38567 | 44.3% | |
| S133 | 29885 | 34.3% | |
| S159 | 5599 | 6.4% | |
| S143 | 4332 | 5.0% | |
| S127 | 1931 | 2.2% | |
| S137 | 1724 | 2.0% | |
| S134 | 1301 | 1.5% | |
| S161 | 769 | 0.9% | |
| S151 | 720 | 0.8% | |
| S157 | 650 | 0.7% | |
| S153 | 494 | 0.6% | |
| S156 | 308 | 0.4% | |
| S144 | 299 | 0.3% | |
| S158 | 208 | 0.2% | |
| S123 | 73 | 0.1% | |
| S141 | 57 | 0.1% | |
| S162 | 36 | < 0.1% | |
| S124 | 24 | < 0.1% | |
| S160 | 11 | < 0.1% | |
| S150 | 10 | < 0.1% | |
| S155 | 4 | < 0.1% | |
| S138 | 3 | < 0.1% | |
| S136 | 3 | < 0.1% | |
| S129 | 3 | < 0.1% | |
| S139 | 3 | < 0.1% | |
| Other values (5) | 6 | < 0.1% |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1 | 88566 | 25.4% | |
| S | 87020 | 25.0% | |
| 2 | 79202 | 22.8% | |
| 3 | 67706 | 19.5% | |
| 5 | 8001 | 2.3% | |
| 4 | 6314 | 1.8% | |
| 9 | 5605 | 1.6% | |
| 7 | 4305 | 1.2% | |
| 6 | 1127 | 0.3% | |
| 8 | 211 | 0.1% | |
| 0 | 23 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 261060 | 75.0% | |
| Uppercase Letter | 87020 | 25.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| S | 87020 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 88566 | 33.9% | |
| 2 | 79202 | 30.3% | |
| 3 | 67706 | 25.9% | |
| 5 | 8001 | 3.1% | |
| 4 | 6314 | 2.4% | |
| 9 | 5605 | 2.1% | |
| 7 | 4305 | 1.6% | |
| 6 | 1127 | 0.4% | |
| 8 | 211 | 0.1% | |
| 0 | 23 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 261060 | 75.0% | |
| Latin | 87020 | 25.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| S | 87020 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 88566 | 33.9% | |
| 2 | 79202 | 30.3% | |
| 3 | 67706 | 25.9% | |
| 5 | 8001 | 3.1% | |
| 4 | 6314 | 2.4% | |
| 9 | 5605 | 2.1% | |
| 7 | 4305 | 1.6% | |
| 6 | 1127 | 0.4% | |
| 8 | 211 | 0.1% | |
| 0 | 23 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 348080 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1 | 88566 | 25.4% | |
| S | 87020 | 25.0% | |
| 2 | 79202 | 22.8% | |
| 3 | 67706 | 19.5% | |
| 5 | 8001 | 2.3% | |
| 4 | 6314 | 1.8% | |
| 9 | 5605 | 1.6% | |
| 7 | 4305 | 1.2% | |
| 6 | 1127 | 0.3% | |
| 8 | 211 | 0.1% | |
| 0 | 23 | < 0.1% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.949804643 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros | 2546 |
| Zeros (%) | 2.9% |
| Memory size | 680.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.69771985 |
|---|---|
| Coefficient of variation (CV) | 0.5755363678 |
| Kurtosis | -0.8576087191 |
| Mean | 2.949804643 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.2211281429 |
| Sum | 256692 |
| Variance | 2.882252688 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 3 | 25260 | 29.0% | |
| 1 | 23906 | 27.5% | |
| 5 | 20266 | 23.3% | |
| 4 | 6577 | 7.6% | |
| 2 | 5931 | 6.8% | |
| 0 | 2546 | 2.9% | |
| 7 | 2302 | 2.6% | |
| 6 | 232 | 0.3% |
| Value | Count | Frequency (%) | |
| 0 | 2546 | 2.9% | |
| 1 | 23906 | 27.5% | |
| 2 | 5931 | 6.8% | |
| 3 | 25260 | 29.0% | |
| 4 | 6577 | 7.6% | |
| 5 | 20266 | 23.3% | |
| 6 | 232 | 0.3% | |
| 7 | 2302 | 2.6% |
| Value | Count | Frequency (%) | |
| 7 | 2302 | 2.6% | |
| 6 | 232 | 0.3% | |
| 5 | 20266 | 23.3% | |
| 4 | 6577 | 7.6% | |
| 3 | 25260 | 29.0% | |
| 2 | 5931 | 6.8% | |
| 1 | 23906 | 27.5% | |
| 0 | 2546 | 2.9% |
LoggedIn
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 680.0 KiB |
| 0 | |
|---|---|
| 1 | 2554 |
| Value | Count | Frequency (%) | |
| 0 | 84466 | 97.1% | |
| 1 | 2554 | 2.9% |
Disbursed
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 680.0 KiB |
| 0 | |
|---|---|
| 1 | 1273 |
| Value | Count | Frequency (%) | |
| 0 | 85747 | 98.5% | |
| 1 | 1273 | 1.5% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| ID | Gender | City | Monthly_Income | DOB | Lead_Creation_Date | Loan_Amount_Applied | Loan_Tenure_Applied | Existing_EMI | Employer_Name | Salary_Account | Mobile_Verified | Var5 | Var1 | Loan_Amount_Submitted | Loan_Tenure_Submitted | Interest_Rate | Processing_Fee | EMI_Loan_Submitted | Filled_Form | Device_Type | Var2 | Source | Var4 | LoggedIn | Disbursed | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | ID000002C20 | Female | Delhi | 20000 | 23-May-78 | 15-May-15 | 300000.0 | 5.0 | 0.0 | CYBOSOL | HDFC Bank | N | 0 | HBXX | NaN | NaN | NaN | NaN | NaN | N | Web-browser | G | S122 | 1 | 0 | 0 |
| 1 | ID000004E40 | Male | Mumbai | 35000 | 07-Oct-85 | 04-May-15 | 200000.0 | 2.0 | 0.0 | TATA CONSULTANCY SERVICES LTD (TCS) | ICICI Bank | Y | 13 | HBXA | 200000.0 | 2.0 | 13.25 | NaN | 6762.90 | N | Web-browser | G | S122 | 3 | 0 | 0 |
| 2 | ID000007H20 | Male | Panchkula | 22500 | 10-Oct-81 | 19-May-15 | 600000.0 | 4.0 | 0.0 | ALCHEMIST HOSPITALS LTD | State Bank of India | Y | 0 | HBXX | 450000.0 | 4.0 | NaN | NaN | NaN | N | Web-browser | B | S143 | 1 | 0 | 0 |
| 3 | ID000008I30 | Male | Saharsa | 35000 | 30-Nov-87 | 09-May-15 | 1000000.0 | 5.0 | 0.0 | BIHAR GOVERNMENT | State Bank of India | Y | 10 | HBXX | 920000.0 | 5.0 | NaN | NaN | NaN | N | Web-browser | B | S143 | 3 | 0 | 0 |
| 4 | ID000009J40 | Male | Bengaluru | 100000 | 17-Feb-84 | 20-May-15 | 500000.0 | 2.0 | 25000.0 | GLOBAL EDGE SOFTWARE | HDFC Bank | Y | 17 | HBXX | 500000.0 | 2.0 | NaN | NaN | NaN | N | Web-browser | B | S134 | 3 | 1 | 0 |
| 5 | ID000010K00 | Male | Bengaluru | 45000 | 21-Apr-82 | 20-May-15 | 300000.0 | 5.0 | 15000.0 | COGNIZANT TECHNOLOGY SOLUTIONS INDIA PVT LTD | HSBC | Y | 17 | HAXM | 300000.0 | 5.0 | 13.99 | 1500.0 | 6978.92 | N | Web-browser | B | S143 | 3 | 1 | 0 |
| 6 | ID000011L10 | Female | Sindhudurg | 70000 | 23-Oct-87 | 01-May-15 | 6.0 | 5.0 | 0.0 | CARNIVAL CRUISE LINE | Yes Bank | N | 0 | HBXX | NaN | NaN | NaN | NaN | NaN | N | Web-browser | B | S133 | 1 | 0 | 0 |
| 7 | ID000012M20 | Male | Bengaluru | 20000 | 25-Jul-75 | 20-May-15 | 200000.0 | 5.0 | 2597.0 | GOLDEN TULIP FLORITECH PVT. LTD | NaN | Y | 3 | HBXX | 200000.0 | 5.0 | NaN | NaN | NaN | N | Web-browser | B | S159 | 3 | 0 | 0 |
| 8 | ID000013N30 | Male | Kochi | 75000 | 26-Jan-72 | 02-May-15 | 0.0 | 0.0 | 0.0 | SIIS PVT LTD | State Bank of India | Y | 13 | HAXB | 1300000.0 | 5.0 | 14.85 | 26000.0 | 30824.65 | Y | Mobile | C | S122 | 5 | 0 | 0 |
| 9 | ID000014O40 | Female | Mumbai | 30000 | 12-Sep-89 | 03-May-15 | 300000.0 | 3.0 | 0.0 | SOUNDCLOUD.COM | Kotak Bank | Y | 0 | HBXC | 300000.0 | 3.0 | 18.25 | 1500.0 | 10883.38 | N | Web-browser | B | S133 | 1 | 0 | 0 |
Last rows
| ID | Gender | City | Monthly_Income | DOB | Lead_Creation_Date | Loan_Amount_Applied | Loan_Tenure_Applied | Existing_EMI | Employer_Name | Salary_Account | Mobile_Verified | Var5 | Var1 | Loan_Amount_Submitted | Loan_Tenure_Submitted | Interest_Rate | Processing_Fee | EMI_Loan_Submitted | Filled_Form | Device_Type | Var2 | Source | Var4 | LoggedIn | Disbursed | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 87010 | ID124806G10 | Male | Nagpur | 28000 | 10-Jun-73 | 31-Jul-15 | 0.0 | 0.0 | 0.0 | UTTAM VALUE STEEL LTD,WARDHA | Central Bank of India | Y | 2 | HAXB | 500000.0 | 4.0 | 14.85 | 10000.0 | 13877.39 | Y | Mobile | G | S122 | 5 | 0 | 0 |
| 87011 | ID124808I30 | Male | Bengaluru | 15000 | 01-Jun-90 | 31-Jul-15 | 0.0 | 0.0 | 0.0 | AIRTEL | Karnataka Bank | Y | 1 | HBXX | 240000.0 | 4.0 | NaN | NaN | NaN | N | Mobile | G | S122 | 3 | 0 | 0 |
| 87012 | ID124810K00 | Male | Bengaluru | 46000 | 02-Jan-85 | 31-Jul-15 | 300000.0 | 3.0 | 0.0 | COGNIZANT TECHNOLOGY SOLUTIONS INDIA PVT LTD | HDFC Bank | Y | 15 | HBXC | 300000.0 | 3.0 | 13.00 | 2400.0 | 10108.19 | N | Web-browser | G | S122 | 4 | 0 | 0 |
| 87013 | ID124811L10 | Male | Secunderabad | 24000 | 01-Jan-90 | 31-Jul-15 | 300000.0 | 3.0 | 0.0 | INDIAN AIR FORCE | State Bank of India | Y | 2 | HBXX | 300000.0 | 3.0 | NaN | NaN | NaN | N | Web-browser | G | S122 | 3 | 0 | 0 |
| 87014 | ID124812M20 | Female | Pune | 49000 | 31-May-82 | 31-Jul-15 | 400000.0 | 5.0 | 0.0 | INFOSYS TECHNOLOGIES | ICICI Bank | N | 14 | HBXX | NaN | NaN | NaN | NaN | NaN | N | Web-browser | G | S122 | 3 | 0 | 0 |
| 87015 | ID124813N30 | Female | Ajmer | 71901 | 27-Nov-69 | 31-Jul-15 | 1000000.0 | 5.0 | 14500.0 | MAYO COLLEGE | ICICI Bank | N | 9 | HBXX | NaN | NaN | NaN | NaN | NaN | N | Web-browser | G | S122 | 3 | 0 | 0 |
| 87016 | ID124814O40 | Female | Kochi | 16000 | 01-Dec-90 | 31-Jul-15 | 0.0 | 0.0 | 0.0 | KERALA COMMUNICATORS CABLE LTD | Federal Bank | Y | 1 | HBXB | 240000.0 | 4.0 | 35.50 | 4800.0 | 9425.76 | Y | Mobile | G | S122 | 5 | 0 | 0 |
| 87017 | ID124816Q10 | Male | Bengaluru | 118000 | 28-Jan-72 | 31-Jul-15 | 0.0 | 0.0 | 0.0 | BANGALORE INSTITUTE OF TECHNOLOGY | Syndicate Bank | Y | 8 | HBXX | 1200000.0 | 4.0 | NaN | NaN | NaN | N | Mobile | G | S122 | 3 | 0 | 0 |
| 87018 | ID124818S30 | Male | Bengaluru | 98930 | 27-Apr-77 | 31-Jul-15 | 800000.0 | 5.0 | 13660.0 | FIRSTSOURCE SOLUTION LTD | ICICI Bank | Y | 18 | HBXX | 800000.0 | 5.0 | NaN | NaN | NaN | N | Web-browser | G | S122 | 3 | 0 | 0 |
| 87019 | ID124821V10 | Male | Mumbai | 42300 | 31-Oct-88 | 31-Jul-15 | 0.0 | 0.0 | 0.0 | GOVERNMENT OF INDIA | NaN | Y | 12 | HBXA | 690000.0 | 4.0 | 13.99 | 3450.0 | 18851.81 | N | Web-browser | G | S122 | 4 | 0 | 0 |